foodborne illness
UCE-FID: Using Large Unlabeled, Medium Crowdsourced-Labeled, and Small Expert-Labeled Tweets for Foodborne Illness Detection
Hu, Ruofan, Zhang, Dongyu, Tao, Dandan, Zhang, Huayi, Feng, Hao, Rundensteiner, Elke
Foodborne illnesses significantly impact public health. Deep learning surveillance applications using social media data aim to detect early warning signals. However, labeling foodborne illness-related tweets for model training requires extensive human resources, making it challenging to collect a sufficient number of high-quality labels for tweets within a limited budget. The severe class imbalance resulting from the scarcity of foodborne illness-related tweets among the vast volume of social media further exacerbates the problem. Classifiers trained on a class-imbalanced dataset are biased towards the majority class, making accurate detection difficult. To overcome these challenges, we propose EGAL, a deep learning framework for foodborne illness detection that uses small expert-labeled tweets augmented by crowdsourced-labeled and massive unlabeled data. Specifically, by leveraging tweets labeled by experts as a reward set, EGAL learns to assign a weight of zero to incorrectly labeled tweets to mitigate their negative influence. Other tweets receive proportionate weights to counter-balance the unbalanced class distribution. Extensive experiments on real-world \textit{TWEET-FID} data show that EGAL outperforms strong baseline models across different settings, including varying expert-labeled set sizes and class imbalance ratios. A case study on a multistate outbreak of Salmonella Typhimurium infection linked to packaged salad greens demonstrates how the trained model captures relevant tweets offering valuable outbreak insights. EGAL, funded by the U.S. Department of Agriculture (USDA), has the potential to be deployed for real-time analysis of tweet streaming, contributing to foodborne illness outbreak surveillance efforts.
- North America > United States > New York (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Nevada > Clark County > Las Vegas (0.04)
- (2 more...)
- Health & Medicine > Epidemiology (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
- Food & Agriculture > Food Processing (1.00)
TWEET-FID: An Annotated Dataset for Multiple Foodborne Illness Detection Tasks
Hu, Ruofan, Zhang, Dongyu, Tao, Dandan, Hartvigsen, Thomas, Feng, Hao, Rundensteiner, Elke
Foodborne illness is a serious but preventable public health problem -- with delays in detecting the associated outbreaks resulting in productivity loss, expensive recalls, public safety hazards, and even loss of life. While social media is a promising source for identifying unreported foodborne illnesses, there is a dearth of labeled datasets for developing effective outbreak detection models. To accelerate the development of machine learning-based models for foodborne outbreak detection, we thus present TWEET-FID (TWEET-Foodborne Illness Detection), the first publicly available annotated dataset for multiple foodborne illness incident detection tasks. TWEET-FID collected from Twitter is annotated with three facets: tweet class, entity type, and slot type, with labels produced by experts as well as by crowdsource workers. We introduce several domain tasks leveraging these three facets: text relevance classification (TRC), entity mention detection (EMD), and slot filling (SF). We describe the end-to-end methodology for dataset design, creation, and labeling for supporting model development for these tasks. A comprehensive set of results for these tasks leveraging state-of-the-art single- and multi-task deep learning methods on the TWEET-FID dataset are provided. This dataset opens opportunities for future research in foodborne outbreak detection.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- North America > United States > Illinois > Champaign County > Urbana (0.14)
- North America > United States > New York (0.04)
- (7 more...)
Artificial Intelligence and Food Safety: Hype vs. Reality
To understand the promise and peril of artificial intelligence for food safety, consider the story of Larry Brilliant. Brilliant is a self-described "spiritual seeker," "social change addict," and "rock doc." During his medical internship in 1969, he responded to a San Francisco Chronicle columnist's call for medical help to Native Americans then occupying Alcatraz. Then came Warner Bros.' call to have him join the cast of Medicine Ball Caravan, a sort-of sequel to Woodstock Nation. That caravan ultimately led to a detour to India, where Brilliant spent 2 years studying at the foot of the Himalayas in a monastery under guru Neem Karoli Baba. Toward the end of the stay, Karoli Baba informed Brilliant of his calling: join the World Health Organization (WHO) and eradicate smallpox. He joined the WHO as a medical health officer, as a part of a team making over 1 billion house calls collectively. In 1977, he observed the last human with smallpox, leading WHO to declare the disease eradicated. After a decade battling smallpox, Brilliant went on to establish and lead foundations and start-up companies, and serve as a professor of international health at the University of Michigan. As one corporate brand manager wrote, "There are stories that are so incredible that not even the creative minds that fuel Hollywood could write them with a straight face."[1]
- North America > United States > California > San Francisco County > San Francisco (0.24)
- North America > United States > Michigan (0.24)
Weakly Supervised Attention Networks for Fine-Grained Opinion Mining and Public Health
Karamanolakis, Giannis, Hsu, Daniel, Gravano, Luis
In many review classification applications, a fine-grained analysis of the reviews is desirable, because different segments (e.g., sentences) of a review may focus on different aspects of the entity in question. However, training supervised models for segment-level classification requires segment labels, which may be more difficult or expensive to obtain than review labels. In this paper, we employ Multiple Instance Learning (MIL) and use only weak supervision in the form of a single label per review. First, we show that when inappropriate MIL aggregation functions are used, then MIL-based networks are outperformed by simpler baselines. Second, we propose a new aggregation function based on the sigmoid attention mechanism and show that our proposed model outperforms the state-of-the-art models for segment-level sentiment classification (by up to 9.8% in F1). Finally, we highlight the importance of fine-grained predictions in an important public-health application: finding actionable reports of foodborne illness. We show that our model achieves 48.6% higher recall compared to previous models, thus increasing the chance of identifying previously unknown foodborne outbreaks.
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Nevada (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Research Report > Experimental Study (0.46)
- Research Report > Promising Solution (0.34)
- Health & Medicine > Public Health (0.61)
- Health & Medicine > Epidemiology (0.53)
- Health & Medicine > Therapeutic Area (0.47)
- Food & Agriculture > Food Processing (0.37)
- Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- (4 more...)
AI used to find unsafe foods using consumer product reviews
A new program can track all recalled foods based from Amazon customer reviews. Called BERT, the AI program identified thousands of recalled products with an accuracy rate of 74 percent. Researchers from the Boston University School of Medicine developed an artificial intelligence (AI) program that can detect unsafe food contaminated with chemicals, toxins, pathogens, and those which are mislabeled of allergens. Many people experience illness resulting from the consumption of unsafe food items, which is now considered a global health problem. Because of this, the researchers developed a machine learning approach to help detect reports of unsafe food items from Amazon, a multinational technology company and the world's largest online retailer.
- Retail > Online (0.73)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.61)
- Health & Medicine > Consumer Health (0.53)
Google algorithm monitors searches to spot restaurants that could give you food poisoning
Google may soon tell you which restaurants could give you food poisoning. The tech giant is working with Harvard University to develop an algorithm that analyzes Google searches to spot which restaurants might have food safety issues. Researchers say it's capable of flagging possible offenders in'near real time.' They created a machine-learning based algorithm to identify unsafe restaurants, training it to look for specific search terms and location data. The model is called FINDER, or Foodborne Illness Detector in Real Time.
- North America > United States > Nevada > Clark County > Las Vegas (0.05)
- North America > United States > Illinois > Cook County > Chicago (0.05)
Google and Harvard develop AI to find restaurants that could make you sick
Worried your go-to hole-in-the-wall might not have a stellar food safety record? Google's new artificially intelligent (AI) system can help lay your fears to rest -- or confirm the worst of them. A study led by researchers at the Mountain View company and Harvard's T.H. Chan School of Public Health describes a machine learning model -- FINDER (Foodborne IllNess DEtector in Real time) -- that leverages search and location data to identify "potentially unsafe" restaurants. Their paper ("Machine-learned epidemiology: real-time detection of foodborne illness at scale") was published today in the journal npj Digital Medicine. "Foodborne illnesses are common, costly, and land thousands of Americans in emergency rooms every year. This new technique, developed by Google, can help restaurants and local health departments find problems more quickly, before they become bigger public health problems," Ashish Jha, K.T. Li Professor of Global Health at Harvard Chan School and director of the Harvard Global Health Institute, said.
- North America > United States > Illinois > Cook County > Chicago (0.08)
- North America > United States > Nevada > Clark County > Las Vegas (0.07)
- Health & Medicine > Epidemiology (1.00)
- Health & Medicine > Consumer Health (1.00)
- Health & Medicine > Public Health (0.95)
- Food & Agriculture > Food Processing (0.87)
Machine-learned epidemiology: real-time detection of foodborne illness at scale
In the 1800s, John Snow had to go door to door during an epidemic of cholera to uncover its mechanisms of spread.1 He recorded where people were getting their drinking water from in order to pinpoint the source of the outbreak. Here we scale up this approach using machine learning to detect potential sources of foodborne illness in real time. Machine learning has become an increasingly common artificial intelligence tool and can be particularly useful when applied to the growing field of syndromic surveillance. Frequently, syndromic surveillance depends upon patients actively reporting symptoms that may signal the presence of a specific disease.2,3
- North America > United States > Illinois > Cook County > Chicago (0.07)
- North America > United States > Nevada > Clark County > Las Vegas (0.06)
- Health & Medicine > Epidemiology (1.00)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.92)
- Health & Medicine > Therapeutic Area > Immunology (0.57)
Machine learning system detects 10 outbreaks of foodborne illness from Yelp reviews
Columbia University and the New York City Department of Health and Mental Hygiene (DOHMH) have developed a machine learning computer system that uses keywords found on Yelp reviews to identify foodborne illnesses and outbreaks. Findings are published in the Journal of the American Medical Informatics Association. Introduced in 2012, a prototype text classifier of the system was developed to determine if a review showed a person experiencing a foodborne illness and to determine if the review indicated multiple foodborne illnesses based on the keywords used in the review. "Effective information extraction regarding foodborne illness from social media is of high importance--online restaurant review sites are popular, and many people are more likely to discuss food poisoning incidents in such sites than on official government channels," said Luis Gravano and Daniel Hsu, coauthors of the study and professors of Computer Science at Columbia Engineering. "Using machine learning has already had a significant impact on the detection of outbreaks of foodborne illnesses."
- Health & Medicine > Epidemiology (1.00)
- Food & Agriculture > Food Processing (1.00)
Deploying nEmesis: Preventing Foodborne Illness by Data Mining Social Media
Sadilek, Adam (University of Rochester) | Kautz, Henry (University of Rochester) | DiPrete, Lauren (Southern Nevada Health District) | Labus, Brian (Southern Nevada Health District, Las Vegas, Nevada) | Portman, Eric (University of Rochester) | Teitel, Jack (University of Rochester) | Silenzio, Vincent (University of Nevada Las Vegas,)
CDC has even identified food safety as one of seven "winnable battles"; however, progress to date has been limited. We show that adaptive inspection process is 64 percent more effective at identifying problematic venues than the current state of the art. If fully deployed, our approach could prevent over 9,000 cases of foodborne illness and 557 hospitalizations annually in Las Vegas alone. Additionally, adaptive inspections result in unexpected benefits, including the identification of venues lacking permits, contagious kitchen staff, and fewer customer complaints filed with the Las Vegas health department.
- Food & Agriculture > Food Processing (0.82)
- Health & Medicine > Epidemiology (0.68)
- Health & Medicine > Consumer Health (0.68)
- Consumer Products & Services > Food, Beverage & Tobacco (0.50)